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Abstract 

In this paper we address the issue of modeling electricity loads and prices with diffu- 
sion processes. More specifically, we study models which belong to the class of generalized 
Ornstein-Uhlenbeck processes. After comparing properties of simulated paths with those of 
deseasonalized data from the California power market and performing out-of-sample forecasts 
we conclude that, despite certain advantages, the analyzed continuous-time processes are not 
adequate models of electricity load and price dynamics. 
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1 Introduction 

The last decade has witnessed radical changes in the structure of electricity markets world-wide. 
Prior to the 1980s it was argued convincingly that the electricity industry was a natural monopoly 
and that strong vertical integration was an obvious and efficient model for the power sector. In 
the 1990s, technological advances suggested that it was possible to operate power generation and 
retail supply as competitive market segments jl| ||. 

The changes that are taking place and the growing complexity of today's energy markets in- 
troduce the need for sophisticated tools for the analysis of market structures and modeling of 
electricity load and price dynamics ^. However, we have to bear in mind that electricity 
markets are not anywhere near as straightforward as financial or even other commodity markets. 
Demand and supply are balanced on a knife-edge because electric power cannot be economically 
stored, end user demand is largely weather dependent, and the reliability of the grid is paramount. 

Recently it has been observed that, contrary to most financial assets ^, 0], electricity price 
processes are mean-reverting ^ ^, In the next Sections we investigate whether electricity 
prices and loads in the California power market can be modeled by generalized Ornstein-Uhlenbeck 
processes, a special class of mean-reverting diffusion processes. 

^Corresponding author. E-mail address: rweron@im.pwr.wroc.pl 
^Research partially supported by KBN Grant no. 8 TlOB 034 17. 
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Figure 1: CalPX market daily average clearing prices since April 1st, 1998 until December 31st, 
2000 {top panel). A typical week in late winter - CalPX market daily average clearing prices since 
February 28th, 2000 until March 5th, 2000 {bottom left panel). The daily cycle - low prices at 
night, high in the evening - CalPX market hourly clearing prices on February 28th, 2000 {bottom 
right panel). 



2 Preparation of the data 

The analyzed database was provided by the University of California Energy Institute (UCEI) ||ll| . 
Among others it contains market clearing prices from the California Power Exchange (CalPX) and 
system- wide loads supplied by California's Independent (Transmission) System Operator (ISO). 
At first we looked at CalPX clearing prices - a time series containing system prices of electricity 
for every hour since April 1st, 1998, 0:00 until December 31st, 2000, 24:00. Because the series 
included a very strong daily cycle we created a 1006 days long sequence of average daily prices (as 
in §), see Fig. 1. 

The price trajectory suggests that the process does not exhibit a regular annual cycle. Indeed, 
since June 2000, California's electricity market has produced extremely high prices and threats of 
supply shortages. The difficulties that have appeared are intrinsic to the design of the market, 
in which demand exhibits virtually no price responsiveness and supply faces strict production 
constraints p2| . It is evident that without taking into consideration regulatory issues, modeling 
electricity prices in the "unstable" California power market is an almost impossible task. Thus, 
instead of forecasting electricity prices, we tried to tackle the "simpler" problem of modeling system 
loads. 

Like for electricity prices, the UCEI database contains information about the system- wide load 
for every hour of the period April 1st, 1998 - December 31st, 2000. Due to a very strong daily 
cycle we have created a 1006 days long sequence of daily loads, which is plotted in Fig. 2. Apart 
from the daily cycle, the time series exhibits weekly and annual seasonality. Due to the fact that 
common trend and seasonality removal techniques do not work well when the time series is only 
a few (and not complete, in our case ca. 2.8 annual cycles) cycles long, we restricted the analysis 
only to two full years of data, i.e. to the period January 1st, 1999 - December 31st, 2000, and 
applied a new seasonality reduction technique. 

The seasonality can be easily observed in the frequency domain by plotting the periodogram, 
which is a sample analogue of the spectral density. For a vector of observations {xi, a;„} the peri- 
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Figure 2: California power market daily system-wide load since April 1st, 1998 until December 
31st, 2000. The annual and weekly seasonality are clearly visible. 



odogram is defined as /„(a-'fe) = ^ exp{— 27ri(< — l)u;fc}| , where Wk = k/n, k = 1, [n/2] 

and [x] denotes the largest integer less then or equal to x. Observe that /„ is the squared absolute 
value of the Fourier transform. In order to use fast algorithms for the Fourier transform we re- 
stricted ourselves to vectors of even length, i.e. n = 2m. In Figure 3 we plotted the periodogram for 
the system-wide load before and after removal of the weekly and annual cycles. The periodogram 
shows well-defined peaks at frequencies corresponding to cycles with period 7 and 365 days. The 
smaller peaks close to LOk — 0.3 and 0.4 indicate periods of 3.5 and 2.33 days, respectively. Both 
peaks are the so called harmonics (multiples of the 7-day period frequency) and indicate that the 
data exhibits a 7-day period but is not sinusoidal. The weekly period was also observed in lagged 
autocorrelation plots Q. These cycles have to be removed before further analysis is carried out, 
since they may influence predictions to a great extent. 

To remove the weekly cycle we used the moving average technique JlSf . For the vector of daily 
loads {xi , . . . , X730 } the trend was first estimated by applying a moving average filter specially chosen 
to eliminate the weekly component and to dampen the noise: rht = ^{xt-z + ■•■ +2:4-1-3)7 where 
t — 4, 727. Next, we estimated the seasonal component. For each k = 1, 7, the average Wk of 
the deviations {{xk+jj — rhk+rj), 3 < fc -|- 7j < 727} was computed. Since these average deviations 
do not necessarily sum to zero, we estimated the seasonal component Sk as Sk = Wk — j J2i=i "^ii 
where fc = 1, ...,7 and Sk — Sfc-7 for k > 7. The deseasonalized (with respect to the 7-day cycle) 
data was then defined St for t = 1, ...,730. Finally we removed the trend from the 

deseasonalized data {d*} by taking logarithmic returns, see the middle panel of Fig. 4. 

After removing weekly seasonality we were left with the annual cycle. Unfortunately, because 
of the short length of the time series (only two years), the method applied to the 7-day cycle could 
not be used to remove the annual cycle. To overcome this we introduced a new method which 
consists of the following: (i) calculate a 25-day rolling volatility flj] for the whole vector; (ii) 
calculate the average volatility for one year (i.e. in our case vt = {v^'^^ -\- w^°*'°)/2); (iii) smooth 
the volatility by taking a 25-day moving average of Vt] (iv) finally, rescale the returns by dividing 
them by the smoothed annual volatility. The obtained time series (see the bottom panel of Fig. 4) 
showed no apparent trend and seasonality (see the bottom panel of Fig. 3). Therefore we treated it 
as a stationary process. In the next Section we fit the deseasonalized load returns by a generalized 
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Figure 3: Periodogram of the California power market daily system-wide load since January 1st, 
1999 until December 31st, 2000 {top panel). The annual and weekly frequencies are clearly visible. 
Periodogram of the load after removal of the weekly cycle {middle panel) and of the load returns 
after removal of the weekly and annual cycles {bottom panel). In the last plot no dominating 
frequency can be observed. 
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Figure 4: California power market daily system-wide load returns since January 1st, 1999 until 

December 31st, 2000 {top panel). Load returns after the removal of the weekly cycle and the 25- 
day rolling volatility {middle panel). Load returns after removal of the weekly and annual cycles 
{bottom panel). 
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Figure 5: Partial autocorrelation function (PACF) of an AR(1) process (top left panel) and the 
deseasonalized system load logarithmic returns {top right panel). Deseasonalized system load log- 
arithmic returns in December 2000 and the Vasicek prediction {bottom panel). 



Ornstein-Uhlenbeck type model. 



3 Modeling with generalized Ornstein-Uhlenbeck type pro- 
cesses 

The deseasonalized data sets were modeled by mean-reverting continuous-type processes of the 
form (generalized Ornstein-Uhlenbeck processes): 

dXt = P{m - Xt)dt + pX^dBt. (1) 

Unfortunately, since we were unable to remove the annual cycle from the system loads themselves 
we had to restrict our analysis to models with 7 = (we estimated 7 £ (0.3, 0.8), but for fractional 
7 the process has to be strictly positive and evidently returns do not comply with this restriction). 
Thus we were left with the so-called Vasicek model . 

We can calibrate the Vasicek model via ordinary linear regression: 

Xt - E{Xt) +et = Xt-ie-^ + m(l - e"^) + eu (2) 

where e* ~ N{0, pe) and p^ is the standard deviation from the regression. Observe that the above 
implies that the Vasicek model is a continuous version of an AR(1) process. This is the main 
reason why it performs poorly for our data sets. The deseasonalized system loads may be an AR 
(Auto Regressive) process, however, of an order greater then 1 (see the PACF plots in Fig. 5, 
which can be used as an estimate of the AR order ||l^). It is worth noting that other diffusions 
of the form (|^) also have a very short AR dependence structure, which would probably result in 
poor prediction of electricity prices or loads. 

For comparison, in the bottom panel of Fig. 5 we plotted actual deseasonalized load returns 
in December 2000 and the Vasicek prediction. The prediction is a one day forecast with model 
parameters estimated from the last 365 daily returns. Unfortunately the fit is far from being 
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perfect. The largest differences occur during Christmas (December 24th-26th), but this can be 
improved by incorporating a hoHday structure into the modeL However, the prediction for the first 
23 days in December is still much worse than the prediction obtained from a simple ARMA(3,3) 
model i.e. the mean absolute deviation from the true values is 0.565 compared to 0.355 for 
the ARM A forecast. 

Even though continuous-time models have certain advantages (like analytic tractability, a de- 
veloped theory of pricing derivatives, etc. ^) over discrete models, further research will be in 
the direction of discrete time series models which offer a much better fit to market data . 
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